Hierarchical Multiresolution Models for fast Object Detection
نویسندگان
چکیده
Day by day, the ability to automatically detect and recognize objects in unconstrained images is becoming more and more important. From security systems and robots, to smart phones and augmented reality, every intelligent device needs to know the semantic meaning of an image. This thesis tackles the problem of fast object detection based on template models. Searching for an object in an image is the procedure of evaluating the similarity between the template model and every possible image location and scale. Here we argue that using a template model representation based on a multiple resolution hierarchy is an optimal choice that can lead to excellent detection accuracy and fast computation. As the search of the object is implicitly effectuated at multiple image resolutions to detect objects at multiple scales, using also a template model with multiple resolutions permits an improved model representation almost without any additional computational cost. Also, the hierarchy of multiple resolutions naturally adapts to a search over image resolutions, from coarse to fine. This leads to a double speed-up due to: an initially reduced set of coarse locations where to search for the object; a lower cost of evaluating the template model. The search over resolutions can be effectuated by using a cascade of multiresolution classifiers, which saves computation by early stopping the search at coarse level when finding easy negative examples. An alternative approach is to locally but uniformly selecting the most promising detection locations at coarse level and, then, iteratively propagate only these ones to the finer resolutions, saving computation. This procedure, that we call coarse-to-fine search, has a speed-up similar to the multiresolution cascade, but a computational time independent of the image content. The coarseto-fine search is then extended to deformable parts models. In this approach, while increasing the model resolution, the hierarchy of models is recursively separated into deformable subparts. In this way, each part can be aligned to the object in the image, producing a better representation and, therefore, an improved detection accuracy with still a reduced computational cost. We validate the different multiresolution models on several commonly used datasets, showing state-of-the-art results with a reduced computational cost. Finally, we specialize the multiresolution deformable model to the challenging task of pedestrian detection from moving vehicles, that requires both high accuracy and real-time performance. We show that the overall quality of our model is superior to previous works and it can lead to the first reliable pedestrian detection based only on images.
منابع مشابه
Automatic Road Detection and Extraction From MultiSpectral Images Using a New Hierarchical Object-based Method
Road detection and Extraction is one of the most important issues in photogrammetry, remote sensing and machine vision. A great deal of research has been done in this area based on multispectral images, which are mostly relatively good results. In this paper, a novel automated and hierarchical object-based method for detecting and extracting of roads is proposed. This research is based on the M...
متن کاملMultiresolution Models for Object Detection
Most current approaches to recognition aim to be scaleinvariant. However, the cues available for recognizing a 300 pixel tall object are qualitatively different from those for recognizing a 3 pixel tall object. We argue that for sensors with finite resolution, one should instead use scale-variant, or multiresolution representations that adapt in complexity to the size of a putative detection wi...
متن کاملA Real Time Adaptive Multiresolution Adaptive Wiener Filter Based On Adaptive Neuro-Fuzzy Inference System And Fuzzy evaluation
In this paper, a real-time denoising filter based on modelling of stable hybrid models is presented. Thehybrid models are composed of the shearlet filter and the adaptive Wiener filter in different forms.The optimization of various models is accomplished by the genetic algorithm. Next, regarding thesignificant relationship between Optimal models and input images, changing the structure of Optim...
متن کاملMultiresolution Implicit Representation of 3D Objects
This paper presents a generic technique for converting any 3D object into an implicit representation, based on multiresolution implicit grids. A specific hash table structure is proposed, which permits compact storage as well as easy hierarchical evaluation of the object. Finally, a tessellator for these multiresolution implicit objects is presented, that takes advantage of the hash table struc...
متن کاملIntroducing Multiresolution Representation in Deformable Object Modeling
The need to simulate the behaviour of biological tissues in virtual environments originated intense research to devise techniques for physically-based modeling of objects whose shape and topology can dynamically evolve. The relevant amount of computation required by the existing techniques and the growing complexity of the scenes to be simulated impose the study of more flexible methods that sh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012